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Broadcast Channels with Delayed Finite-Rate 
Feedback: Predict or Observe? 

Jiaming Xu, Jeffrey G. Andrews, Syed A. Jafar 

Abstract 

Most multiuser precoding techniques require accurate transmitter channel state information (CSIT) to 
maintain orthogonality between the users. Such techniques have proven quite fragile in time-varying channels 
because the CSIT is inherently imperfect due to estimation and feedback delay, as well quantization noise. An 
alternative approach recently proposed by Maddah-Ali and Tse (MAT) allows for significant multiplexing gain 
in the multi-input single-output (MISO) broadcast channel (BC) even with transmit CSIT that is completely 
stale, i.e. uncorrected with the current channel state. With K users, their scheme claims to lose only a log(.K') 
factor relative to the full K degrees of freedom (DoF) attainable in the MISO BC with perfect CSIT for large K. 
However, their result does not consider the cost of the feedback, which is potentially very large in high mobility 
(short channel coherence time). In this paper, we more closely examine the MAT scheme and compare its DoF 
gain to single user transmission (which always achieves 1 DoF) and partial CSIT linear precoding (which 
achieves up to K). In particular, assuming the channel coherence time is N symbol periods and the feedback 
delay is we show that when N < (1 + o(l))K\ogK (short coherence time), single user transmission 
performs best, whereas for N > (1 + o(l))(iVjfd + K/logK)(l — log -1 K)^ 1 (long coherence time), zero- 
forcing precoding outperforms the other two. The MAT scheme is optimal for intermediate coherence times, 
which for practical parameter choices is indeed quite a large and significant range, even accounting for the 
feedback cost. 

I. Introduction 

Interference is a key bottleneck in wireless networks and sophisticated interference reduction tech- 
niques such as multiuser MIMO |0Q, GO, interference alignment (IA) and network MIMO [4] are of 
great interest to researchers and industry. While these techniques in theory offer substantial multiplexing 
gains, meaning they support more effectively interference-free streams, they typically require highly 
accurate transmitter channel state information (CSIT) to achieve said gains. For example, in the case 
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of the broadcast channel with M transmit antennas and K > M users, the optimal multiplexing gain 
with perfect CSIT and CSIR is M, which can be achieved by either dirty paper coding (DPC) flU, 
0, (61 or even by suboptimal linear precoders including zero-forcing (ZF) [7|. Without any CSIT, 
however, the multiplexing gain collapses to 1 for i.i.d. channels Q] or the more general isotropic 
channel M- 

Real systems necessarily have imperfect CSI, particularly at the transmitter. Because the channel 
state must be estimated (usually from noisy training symbols), then quantized to a finite-rate value, 
and finally fedback over a noisy channel in a specified periodic time-slot, the transmitter will have a 
delayed and noisy estimate of the actual channel statej. In essence, multiuser precoders are relying 
on a channel prediction in their attempt to separate the users into interference-free channels. In this 
paper, we only consider the loss due to the delay and finite-rate quantization. 

A. Background and Motivation 

The imperfect CSIT issue is by now quite well understood by both academia and industry. Simple 
multiuser MIMO techniques such as ZF precoding with limited CSI feedback |j9~l- lfl"3l have been 
studied extensively and also implemented in 4G cellular systems [14J, lfl"5ll . One theoretical observation 
is that the feedback rate must scale linearly with log 2 snr to maintain the full multiplexing gain 
with partial CSIT [9]. However, even with this feedback rate, the feedback delay will cause serious 
degradation when the feedback delay approaches (or exceeds) the channel coherence time, causing 
multiuser precoding techniques to achieve a lower rate than single user ones due to multiplexing 
gain loss, regardless of the feedback rate f[T6l . Primarily because of this sensitivity, multiuser MIMO 
techniques (also called SDMA) have been largely disappointing in the field and it is widely agreed 
they are only of use at very low mobility (pedestrian speeds at most) IfPTll . IfTHll . 

A clever recent work [|T9l gets around this apparently fundamental delay limitation by instead 
exploiting previous channel observations to increase the multiplexing gain through a novel feedback, 
transmission, and receiver cancellation scheme. This technique, which we term the Maddah-Ali-Tse 
(MAT) scheme, achieves a multiplexing gain in a MISO broadcast channel of rrrr- , j_ which is 

l+ 2 +.. .+ K 

Kj log(K) for large K even with completely outdated CSIT. This is nearly as good as the multiplexing 
gain of K for perfect CSIT precoding schemes. It was subsequently shown in ll20l that a similar 

'Time division duplex (TDD) systems may in principle be able to exploit reciprocity to reduce delay and quantization loss, but such 
losses still exist. Furthermore, the reciprocal approach is usually not viable in current systems since the downlink and uplink are nearly 
independent: the existence of a downlink packet for a user does not imply that the user is also transmitting in the uplink, especially in 
the same frequency coherence band. 
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conclusion holds for the the X and interference channels using a novel retrospective interference 
alignment scheme. The guiding principle behind these results is that with significant feedback delay, 
channel "predictions" are bound to fail, but channel "observations" can be exploited via eavesdropping 
and feedback to retroactively remove interference. 

A potentially important consideration for these two approaches is the resources they consume on 
the feedback channel. Both lfT9l and ll20l assumed a cost-free infinite rate feedback channel. The MAT 
scheme is primarily of interest for mobile scenarios, since with (very) low mobility the conventional 
channel prediction techniques that achieve the full K DoF can be used. Therefore, the MAT scheme 
still inherently requires frequent and accurate channel state feedback, and its main benefit is that it is 
robust to the feedback delay. The goal of this paper is to determine how much gain (if any) is possible 
with an outdated/observed CSIT approach, while correctly accounting for the unavoidable feedback 
channel overhead. 

B. Summary of Main Result 

The main technical contribution of the present paper is to determine the net DoF provided by the 
MAT scheme lfT9l for a K user MISO broadcast channel. The net DoF is the prelog capacity term 
remaining after subtracting off the feedback DoF consumed (which depends on the feedback rate). 
This can then be compared to two other baseline techniques: (i) the no CSIT single user transmitter 
that always gets 1 DoF and does not require feedback, and (ii) the partial CSIT zero-forcing precoder 
that gets up to the full K DoF when the CSIT is sufficiently current and accurate. Although many 
other techniques could be chosen and compared to MAT, these two provide an instructive baseline 
comparison, and we conjecture that other single user and multiuser precoding schemes would result 
in a very similar tradeoff. 

We start with the K = 2 user MAT scheme, proceed to K = 3 and finally provide a general result 
for all K. The approach leading to the net DoF result proceeds in two steps in each case. First, we 
derive the multiplexing gain (DoF) with finite rate feedback as a function of a parameter a > 0, where 
a — > is bounded rate CSIT and a — > oo is perfect CSIT. The DoF loss relative to what is reported 
in [fT9l is zero as long as the feedback amount is sufficiently large. For example, for K — 2 one 
requires a > 1 to achieve the | DoF that the MAT scheme promises. Second, we determine how the 
sum feedback overhead increases in units of DoF as a function of a and subtract that from the DoF 
gain found in step 1. 

Interestingly, we find that there are regimes where each of the three techniques is the best one, 
where the regimes are defined by the coherence time iV and feedback delay N id in units of symbol 
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times. Specifically, when the coherence time N < (l + o(l))KlogK, there is no value of a that allows 
a net increase in DoF from the MAT scheme, i.e. the net DoF is strictly less than 1 for any K, which 
can be achieved by single-user transmission. Furthermore, when N > (1 + o(l))(N id + Kj logfT)(l — 
log -1 K)~ x , there is no value of a that allows the MAT scheme to outperform ZF precoding. However, 
the MAT scheme does provide a net increase in DoF for the optimal value of a in between these two 
extremes. The main result and this tradeoff is summarized in Fig. [21 

The intuition behind this result is straightforward. The feedback rate for the MAT scheme must 
be held low in order to not overwhelm the forward direction rate gain. But for a low feedback rate, 
the resulting channel quantization error becomes large and the MAT scheme fails to work well. This 
primarily applies to the high mobility (short coherence time) scenario, since feedback in that case must 
be frequent. For sufficiently long coherence times, the feedback delay problem recedes and eventually 
the conventional orthogonalizing precoders became viable, which approach and eventually achieve 
the full K DoF. The MAT scheme fills a useful niche for moderate mobility/coherence times, which 
appears to be a quite broad and relevant regime for reasonable parameter choices. For example, using 
a standard LTE air interface with four transmit antennas and a carrier frequency of 2.1GHz, we find 
that the MAT scheme is preferable to ZF and single user transmission for velocities ranging from 
about 27 km/hr up to airplane-type speeds. 

II. System Model 

A MISO broadcast channel with M transmit antennas and K single antenna receivers is considered. 
In this paper, we assume M = K for simplicity^. In a flat fading environment, this channel can be 
modeled as 

y r [t]=h* r [t}x[t}+z r [tl r = l,...,K, (1) 

where y r [t) is the received signal of receiver r at symbol time t, x[i] G C Mxl is the transmit signal with 
the average power constraint E[x*[i]x[£]] < P, and z r [t) ~ £/V(0, 1) is the additive white Gaussian 
noise. The channel state vector of receiver r is denoted by h*[t] G C lxM and the channel state matrix is 
defined as H[i] = [hi[i], . . . , [£]]. The channel is assumed to be block fading: H[i] remains constant 
over a block of N symbols, and is comprised of i.i.d. unit variance complex Gaussian random variables 
for each block. It follows that the H[t] is full rank with probability 1. 

2 Diversity gain can be achieved when K > M (user selection) or M > K (antenna selection), but those only affect the SNR and we 
are interested in the multiplexing gain. Furthermore, when K 3> M, the opportunistic beamforming will achieve nearly optimal degrees 
of freedom M with small feedback overhead [21], but K needs to grow with SNR which is not assumed in this paper. 
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We consider a delayed finite-rate feedback model. Each receiver is assumed to have an instantaneous 
and perfect knowledge of its own channel vector h r [t]. It then quantizes its channel vector to Q bits 
and feeds back the bits perfectly to the BS with delay of N {d symbols. Notice that we also assume that 
the receivers obtain the channel state of all other receivers via broadcasting in the forward channel 
from BS. 

The channel state quantization is performed using a fixed vector quantization codebook that is known 
to the transmitter and all receivers. The codebook C consists of 2 Q M-dimensional unit norm vectors: 
C = {wi, . . . , w 2 q}. The receiver quantizes its channel vector to the closest quantization vector, i.e., 
the quantization index at time t is 

q[t] = arg min sin 2 (Z (h r [t\, w^)) , 

j=l,...,2<3 

Note that only the direction of the channel vector is quantized and fed back, and no information 
regarding the channel magnitude is conveyed to the transmitter. Also, in this paper, we consider the 
optimal codebook over any vector quantization codebook. 

The key performance metrics considered in this paper are the degrees of freedom (also known as 
the multiplexing gain), the feedback overhead and the net DoF. Let R(P) denote the total average 
throughput with transmit power P. The multiplexing gain with K receivers is defined as 

T>o¥(K) : = lim (2) 

V ' P^oo log 2 P 

The DoF is the prelog of the capacity, and is the number of equivalent channels that carry rate log 2 P 
at high SNR. Let F(P) denote the total feedback rate, then the feedback overhead with K receivers 
is formally defined as 

FB(K) := lim HQ- 

which measures how quickly the feedback rate increases with log 2 P. Finally, we define the net 
multiplexing gain as 

DoF(iT) := DoF(fT) - FB(K). 

The net DoF makes explicit the feedback cost, which is quite important when comparing approaches 
that require differing amounts of feedback, as in this paper. 



III. BACKGROUND 

In this section, for clarity, we briefly summarize two key previous results, and introduce the 
traditional zero-forcing as well as modifying the MAT scheme to use finite rate feedback. 
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A. Multiplexing Gains with Outdated CSIT but no Quantization Error 

The DoF with outdated CSIT but no quantization error was given by the following Theorem [fT9l . 

Theorem 1. 4791/ The optimal multiplexing gain with outdated CSIT is 

DoF*(K) = * (3) 

' 2 K 

The optimal DoF is achieved by the MAT scheme which was also introduced in |[T9l , which works 
as follows. The information symbols intended for a particular receiver can be overheard by other 
receivers. Even with outdated CSIT fed back by the receiver, the transmitter can exploit this overheard 
side information to create future transmissions which are simultaneously useful for more than one 
receiver. However, the result in [fl9l implicitly assumes the feedback is free. 

B. Optimal Vector Quantization 

We now briefly review some basic results on optimal vector quantization in MISO broadcast channel 
from BH, [|22|-[[24l. Let e denote the quantization of the channel vector h and 9 denote the angle 
between h and e. For the optimal vector codebook, the lower and upper bound to the expected value 
of quantization error are given by [|9] 

M 1 Q r r, t Q 

— 2-—^ < E H [sin 2 9] < 2"*=i. (4) 
Thus, the bounds are tight when the number of transmit antennas M is large. 

C. Zero Forcing with Delayed Finite-Rate Feedback 

In the slow fading scenario where the feedback delay iVf d is smaller than the coherence time N, 
channel prediction schemes can be adopted. In this paper, the zero-forcing scheme (ZF) is considered, 
since multiplexing gain is of interest and the zero-forcing scheme can achieve the full K multiplexing 
gain with perfect CSIT Q, [0. The ZF scheme proceeds as follows: at the beginning of each 
block, each user quantizes its own channel vector to B bits and feedbacks the quantization index. 
The transmitter receives the feedback with delay Nfd and uses zero-forcing precoding based on the 
quantized and delayed CSI over the remaining iV — Nj d symbol times. According to the analysis in 
[|9l , if the quantization rate Q is scaled as Q = a(K — 1) log 2 P for a > 0, the DoF under the ZF 
scheme is 

DoF ZF (K) = (l - ^ j (a A 1)K, (5) 
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and the feedback overhead is aK ^ — . Therefore, the net DoF is 



6sp mk) =K (i _ , (6) 

which has a maximum value K(l — Affd+ ^ r ~ 1 ^ ) with a = 1. 

D. Exploiting Outdated CSIT via Delayed Finite-Rate Feedback 

In this subsection, we consider the MAT scheme with finite rate CSI feedback. Since the interference- 
limited case and multiplexing gain are of interest, Gaussian noise is omitted for simplicity. Also, the 
feedback delay is assumed to be identical to the coherence time in the description, i.e., Nm = N, but 
the MAT scheme can be extended to the general case. Let u r and v r denote the symbols from two 
independently encoded Gaussian codewords intended for receiver r. The transmission scheme consists 
of two phases, which take three symbol times over three consecutive blocks in total as shown in Fig.Q] 

Phase one: Feeding the Receivers. This phase has two symbol times. In the first symbol times of 
block 1, the BS transmits the two symbols, ua and va, intended for the receiver A, i.e., X[l] = [ua, va} 1 - 
At the receivers, we have 

VaW\ = h A1 [l)u A + h* A2 [l)v A ■= L A (u A ,v A )[l], 
y B [l] = h* B1 [l]u A + h* m [l]vA ■= L b (ua,v a )[1]. 

At the same time, receiver B measures the channel and obtains perfect knowledge of its channel vector. 
It then quantizes the channel vector and feeds back the quantization index. Let e B [l] = [eeifl], es2[l]]* 
denote the quantized channel vector of receiver B. 

Similarly, in the second symbol times of the block 2, the BS transmits the two symbols, ub and 
vb, intended for the receiver B, i.e., X[2] = [ub,vbY- At the receivers, we have 

y A [2] = h* M [2]u B + h* A2 [2}v B := L A (u B ,v B )[2], 
y B [2\ = h* m [2]u B + h* B2 [2]v B := L B (u B ,v B )[2]. 

At the same time, receiver A feeds back its quantized channel vector e^[2] = [eAi[2], e^2[2]]*. 

A key observation is that because the channel matrix H[l] is full rank with probability 1, if the 
receiver A has the equation L b (ua, overheard by the receiver B, then it has enough equations 

to solve for its own symbols ua and va- The same story holds for receiver B. Therefore, the goal of 
the second phase is to swap these two overheard equations through the transmitter. 

Phase two: Swapping the overheard Equations. This phase has one symbol time in the block 3. 
Since we assume the feedback delay is one block duration, the quantized channel vectors at symbol 



s 



time 1 and 2 are available at the transmitter. Thus, the BS can transmit Lb(u a , v a )[1) + L a (u b , v b )[2], 
i.e., X[3] = [L B {u A ,v A )[l] + L A (u B , v B )[2], 0]*, where L B (u A , v a )[1] = e* B1 [l}u A + e* B2 [l]v A and 
Aa(ub,u.b)[2] = e* A1 [2]u B + e A2 [2]f B . The receiver observes 



Va[3\ = h* M [3] [L B (u A ,v A )[l] + L A (u B} v B )[2] 
2/b[3] = ^bi[3] [L B (u A ,v A )[l] + L A (u B ,v B )[2] 



Va[1] 




Va[2] 




va[3] 





u A 
va 



+ 





h* A1 [2] h* A2 [2) 
^i[3]e A1 [2] h* A1 [3}e A2 [2] 



u B 
vb 



(7) 



u A 




Mb 






va 







(8) 



The transmission scheme is summarized in Fig. [T] Putting all these received equations together in 
matrix form, for receiver A we have 

^i[l] h* A2 [l] 

h A1 {3]e B1 [l] ^ A1 [3]e* 2 [l] 
rewritten in a simpler form as 

Va[1] 
yA[2]/|h A [2]| 
y A [3]/\h A1 [3}\ 

where I* B denotes the interference from transmitting symbols intended for receiver B. This equation has 
an interference alignment interpretation. The received signal y A lies in the 3-dimensional vector space. 
The intended symbols w A and v A of receiver A are sent along the vectors [/i A1 [l], 0, fr A i[3]e B1 [l]] and 
[/i A2 [l]> 0, /i A1 [3]e B2 [l]]* respectively. The unintended symbols u B and v B are sent along the vectors 
[O,^^,^^^^]' and [O^^Mh^e^f respectively. 

If the quantization error is 0, then the quantized channel vector e A [2] is in the same direction as 
the true channel vector h A [2] and thus u B and vb are sent along the same direction. Therefore, by 
zero-forcing the interference, the receiver A has 2 interference-free dimensions. Also, due to the CSI 
broadcast from BS, the receiver A knows the quantized channel vector e B [l) of receiver B. Therefore, 
the receiver A can recover its intended symbols u A and v A by solving two independent linear equations. 
Similarly, for the receiver B, it also has 2 interference-free dimensions and can recover its intended 
symbols u B and v B . All in all, we recover 4 symbols using 3 symbol times and thus | degrees of 
freedom is achieved. 

However, it is easy to see that with finite rate feedback, the quantization error is in general nonzero 
and thus less than | degrees of freedom will be achieved in general. 
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IV. Main Results 



In this section, the impact on the DoF of accounting for finite rate feedback is investigated for the 
MAT scheme, and then compared to single user and ZF transmission. We first start with the two user 
case, then move to three users and finally generalize to the K user case. The proof techniques are 
essentially the same, but the details of the K = 2 and K = 3 cases are instructive to understand the 
general case. 

A. Two User Case, K = 2 

We first consider the two user case as introduced in Section IIII-DI The following lemma relates the 
multiplexing gain to the singular values of interference matrix 1^. 

Lemma 1. For the two user case, the DoF with the MAT scheme is 



where a 2 > is the second largest singular value of I* B . 
Proof: See Appendix \M 

■ 

From Lemma \T\ we can derive the following theorem which relates the multiplexing gain to the 
quantization accuracy Q, which is the feedback rate per user per feedback interval. 

Theorem 2. For the two user case, if the quantization rate Q is scaled as Q = a log 2 P for any 
a > 0, the multiplexing gain under the MAT scheme is 2 ^°^ a |. 

Proof: See Appendix EJ 

■ 

In Theorem [21 we require a > in order to ensure that Q — > oo as P — > oo. However, if a — > 
then the asymptotic multiplexing gain is | in the bounded Q case. Therefore we conclude that for 
all a the multiplexing gain is between | and |. When a > 1, the full multiplexing gain | can be 
achieved, which recovers the result in [fl9l . Finally, we note that when a < |, the multiplexing gain 
becomes less than 1 which is less than the no CSIT case (i.e. single user transmission). 

The following lemma characterizes the feedback overhead required by the MAT scheme. 

Lemma 2. For the two user case, if the quantization rate Q is scaled as Q = a log 2 P for a > 0, 
then the feedback overhead under the MAT scheme is -^ba. 




(9) 
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Proof: From the transmission scheme, it can be seen that receiver A has to feedback its channel 
vector at symbol time 2 of the block 2 to align the interference; while receiver B has to feedback 
its channel vector at symbol time 1 of the block 1 to align the interference. Therefore, over total 3 
blocks, 2 channel vectors must be fedback. It follows then the aggregate feedback rate is log 2 P 



bits/slot and thus the feedback overhead is nf^a. 

Combining Theorem |2] and Lemma |2] gives the net DoF as 

DoF MAT (ir = 2) = DoF(K) - FB(AT) 



§((1 + ^<.)A(2-£)|, (10) 



which has a maximum of 



2(2AT-1) 



37V " 

For comparison, let us consider the single user transmission (SISO) and zero-forcing scheme. For 
single user transmission, it is easy to see that the net DoF is 1, since no CSI feedback is needed. For 
zero-forcing, according to ©, the maximum net DoF with a = 1 is 

DoF ZF (K = 2) = 2(l- . (11) 



N 

When N < 2, the maximum net DoF with MAT is no greater than 1 and thus the MAT scheme 
cannot provide a net gain in DoF compared to SISO for K = 2 and any value of a. When iV > 3iVf d +2, 
the maximum net DoF with MAT is less than that with ZF, and thus the MAT scheme cannot provide 
a net gain for K = 2 and any value of a either. However, when 2 < iV < 3iVfd + 2, the MAT scheme 
can provide a net gain in DoF compared to SISO and ZF for K = 2 with optimal value of a = 1. 
Fig. [3] present numerical results of the maximum net DoF for the three techniques respectively. The 
feedback delay is assumed to be iVfd = 100 for clarity. The actual feedback delay depends on the 
system design. For example, in a narrowband channel with symbol rate 100 KHZ, iVf d = 100 means 
the feedback delay is 1 msec. From the figure, it can be seen that the MAT scheme does provide a 
net DoF gain when the 2 < N < 302, which is a significant range. 

B. Three User Case 

In this subsection, we extend to the three user case. According to [19], the MAT scheme takes 11 
symbol times in 11 consecutive blocks. The received signal of user A over the total 11 symbol times 
can be written in matrix form as 

y A = H> A + I* [u^u^, (12) 
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where 114 is a 6 x 1 vector of symbols intended only for user A, H\ is a 11 x 6 matrix with rank 
6, and I* B c is a 11 x 12 interference matrix. Notice that 1* B c must contain 2 zero row vectors, since 
among the 11 symbol times, 2 symbol times are used for transmitting the symbols only intended for 
user A and thus there is no interference. 

Without quantization error, the rank of 1^ c is 5 and thus the interference lies in a 5-dimensional 
subspace. Therefore, 6 degrees of freedom among 11 symbol times can be achieved by zero forcing 
the interference for each user. With quantization error, the interference will spill out of the 5-dim 
subspace and the zero-forcing scheme cannot eliminate all the interference. Similar to the two user 
case, we have the following lemma, which relating the multiplexing gain with the singular values of 

l B,c'-{ a ii ■ ■ • ^ill- 
Lemma 3. For the three user case, the multiplexing gain under the MAT scheme is 



From Lemma |3] we can derive the following theorem which relates the multiplexing gain to the 
quantization accuracy Q. 

Theorem 3. For the three user case, if the quantization rate Q is scaled as Q = 2a log 2 P for a > 0, 
the multiplexing gain under the MAT scheme is A . 



The feedback overhead is given by the following lemma. 

Lemma 4. For the three user case, if the quantization rate Q is scaled as Q = 2a log 2 P for a > 0, 
then the feedback overhead under the MAT scheme is t^tQz. 

Proof: From the transmission scheme, it can be seen that the key role played by the feedback 
is to ensure the interference in the future transmissions always lies in the 5-dimensional subspace. 
Therefore, each receiver have to feedback its channel vector 5 times to inform the transmitter its 
specific 5-dimensional subspace. Thus, over total UN symbol times, 15 channel vectors must be 
fedback in total. It follows then the total feedback rate is j^a\og 2 P bits/slot. ■ 




(13) 



Proof: See Appendix O 



Proof: See Appendix O 
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Combining Theorem |3] and Lemma |4] gives the net DoF as 

DoF MAT (ir = 3) = DoF(fT) — FB(K) 



= n( (1 + ^ a)A(3 -^»)' <14) 

which has a maximum of ^j^r^- 

The maximum net DoF with SISO is still 1 and according to ©, the maximum net DoF with ZF 

for K = 3 is 

DoF ZF (K = 3) = 3 (l - + . (15) 

.30 



When N < y, the maximum net DoF with MAT is no greater than 1 and thus the MAT scheme 

5 



cannot provide a net gain in DoF compared to SISO for K = 3 and any value of a. When iV > 11JVfd+12 



the maximum net DoF with MAT is less than for ZF The useful range therefore is 

30 1L/Vm+12 
7 - - 5 

for which the MAT scheme can achieve a net gain. Fig. @] shows the maximum net DoF for the three 
techniques for a feedback delay of Na = 100. The MAT scheme provides a net DoF gain in this case 
for 5 < N < 222. 

C. General K User Case 

In this subsection, we generalize the previous results to the K user case. Let T K — j- = yP, 
where D,T E J\f. According to [19], the transmission scheme for the K user case takes T symbol 
times in the T consecutive blocks, and the received signal for user A over T symbol times can be 
written in matrix form as 

y A = H* A u A + Y fA vi/ A , (16) 

where is a D x 1 vector of symbols intended for user A, vl/ A is a (K -l)Dxl vector of symbols 
intended for users other than A (the subscript /A means all other users except user A). Yl* A is a T x D 
matrix with rank D, and l* A is a T x (K — 1)D matrix. Notice that, I* A contains zero row vectors, 
since among the T symbol times, slots are used for transmitting D symbols only intended for user 
A. 

Without quantization error, the rank of 1* A is (T — D) and thus the interference lies in a (T — D)- 
dimensional subspace. Therefore, D degrees of freedom for each user can be achieved by zero forcing 
the interference. With quantization error, the interference will spill out of the (T — D) -dimensional 
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subspace and the zero-forcing scheme cannot eliminate all the interference. Similar to the two and 
three user case, we have the following lemma, which relates the multiplexing gain with the singular 
values of I* A : {a 1: . . . , a T }. 

Lemma 5. For the K user case, the multiplexing gain under the MAT scheme is 

W °_ff ( h Mil v 

i= T-D+l v 62 7 

Proof: See Appendix El ■ 
The following theorem generalizes Theorem |2] and Theorem |3] 

Theorem 4. For the K user case, if the quantization rate Q is scaled as Q = a{K — 1) log 2 P for 
a > 0, the multiplexing gain under the MAT scheme is ^ 1+ ^T 1 ^° ! A 1 j DoF*(K). 

Proof: See Appendix IB ■ 
When a > 1, the optimal multiplexing gain without feedback rate constraint DoF*(i^) can be 
achieved; while if a < a* = n ,i_f ,j_L_ ir the multiplexing gain drops to less than 1 and outdated 
CSIT becomes useless. 

The feedback overhead is characterized by the following theorem, which generalizes Lemma |2] and 
Lemma @] 

Lemma 6. For the K user case, if the quantization rate Q is scaled as Q = a(K — 1) log 2 P for 
a > 0, then the feedback overhead with the MAT scheme is -7rn~r 2 + 'L^n " a - 

(1+ 2 +...+ K )1V 

Proof: From the transmission scheme, it can be induced that the key role played by the feedback 
is to ensure the interference for a particular user in future transmissions always lies in a (T — D)- 
dimensional subspace. Therefore, each receiver has to feedback its channel vector (T — D) times to 
inform the transmitter its specific (T — D) -dimensional subspace. Thus, over total NT symbol times, 
K(T — D) channel vectors need feedback. It follows that the feedback rate is i^ 2+ ±Im^ a l°g2 P 
bits/slot. Finally, the theorem follows by invoking the definition of feedback overhead. ■ 
Combining Theorem |4] and Theorem |6] gives the net DoF as 

DoFmatW = DoF(K) - FB(AT) 

N + a{K-l){N-K{\ + ... + ±)) a K{N-a{K-l){\ + . .. + $)) ^ 
(l + I + ... + i)iV + 1 + ^ 

.... . x(iV-(K-l)(i+...+i)) 

which has a maximum of , Jj_-iAf • 

(1+ J +...+ r, jJV 
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The maximum net DoF with SISO is still 1 and according to ©, the maximum net DoF with ZF 

for K = 3 is 

6Tf zf (/ = k (i - + . (19) 

As in the K = 2 and K = 3 case, we can identify a range where the MAT scheme is worth- 
while. When N < ^~ 1) ^ 4 + -; + j ) = (1 + o(l))KlogK, single user SISO is better, while for 

K (l+ 2 +...+ K J 

iV > (i±i±^±i^±^zl = (1 + o(l))(JVfd + K/logiT)(l - log -1 If)" 1 , zero forcing is preferable. 
The MAT scheme is the best for all other values of iV when using the optimal value of a = 1. 

Fig. [2] provides a visual summary of the main results. We fix the number of users K and vary the 
block duration N, and plot the net DoF attained by the MAT, single user transmission (SISO), and 
zero-forcing (ZF). The three different channel coherence time regimes can be immediately observed: 
(i) short coherence time (N < (1 + o(l))KlogK), (ii) moderate coherence time ((1 +o(l))K log K < 
N < (l + o(l))(iV fd + is:/log/0( 1 - 1 og~ 1 K)- 1 ), and (iii) long coherence time (JV > (l + o(l))(JV H + 
K/\ogK){\ - log -1 K)- 1 ). One would prefer to choose SISO for (i), MAT for (ii) and ZF for (iii). 

D. Analog vs. Digital Feedback 

An alternative way to feed back CSI is analog feedback, where each receiver feeds back its channel 
vector h r by explicitly transmitting M complex coefficients over a unfaded additive Gaussian noise 
feedback channel: 

G [t] = VPH [t - N M ] +Z[t], (20) 

where G[t] is the received channel state feedback and Z[t] is the Gaussian noise in feedback channel. 
As H[i] is composed of i.i.d. complex Gaussian with unit variance, the optimal estimator of CSI is 
the MMSE estimator given by 



H[* - iV fd ] = J^pG[tl (21) 

where H[t] is the estimator of true channel state H[t]. Since we are interested in the scaling rate of the 
feedback rate with respect to log 2 P as P — > oo, the estimator noise can be neglected and the CSIT is 
accurate and only subject to feedback delay. Therefore, the DoF with noiseless analog feedback is the 
same as that with accurate CSIT, and the sum feedback overhead is Then, the net DoF is given 
by 

D ° FmAtW = (1 + 1 + ... + ^ ' (22) 
Mzf W =K(l- ■ (23) 
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Compare it to the result with digital (quantized) feedback, we see that the net DoF with analog feedback 
is almost the same as the maximum net DoF (a = 1) with digital feedback. Therefore, the tradeoff 
between coherence time, feedback rate/delay, and the transmission techniques shown in Fig. [2] remains 
the same with analog feedback. Note that the digital feedback appears to be more flexible than analog 
feedback, since it can adjust a to meet the feedback rate constraints and achieve a gradual degradation 
of net DoF while analog feedback is only feasible when feedback overhead ^ is supportable in the 
feedback channel. 

E. Design Guidelines 

This subsection translates the previous analytical results into rough design guidelines for a real-world 
system. To be concrete, we adopt the parameters used in the 3GPP LTE standard lfT5l . The carrier 
frequency is chosen to be f c = 2.1 GHz and resources are allocated to users in "resource blocks" in 
the time-frequency grid consisting of 12 subcarriers, spanning 180 KHz in frequency, over 14 OFDM 
symbols, which spans 1 msec in time. Therefore, a data symbol slot is effectively T s = 1/168 msec 
since there are 12 x 14 = 168 symbols sent in a msec. The typical CSI feedback delay is assumed 
to be an LTE frame, which is 10 msec, so iVf d = 10 x 168 = 1680 symbols. Assuming the standard 
relation between channel coherence time, Doppler spread, and user velocity v, we have v = j ^ T m/s, 
where c is the speed of light. Then, based on the results in the previous section, approximate regimes 
where the MAT scheme achieves a net DOF gain are summarized in Table [Q We caution against taking 
this table too literally, since other factors not modeled in this paper may play a role in the regimes of 
optimality, but nevertheless it appears that the MAT scheme provides a net DoF gain for a very large 
range of mobility. The upper limit is where the mobility is so high that it is better to switch to simple 
single user transmission, whereas the lower bound on velocity represents the crossing point over to 
ZF precoding. We observe that as the number of antennas (and thus users) increases, ZF precoding 
takes on increasing role since its maximum achievable DoF is K vs. MAT's Kj \ogK and the log if 
gap becomes more significant. 

Appendix 

A. Proof of Lemma \B 

Proof: As can be seen in ©, in order to achieve the maximal multiplexing gain, receiver A must 
attempt to zero-force the interference, i.e., 



U* A y A = 


ua 


+ w A r B 


u B 




va 




vb 
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where XJ* A is a 2 x 3 zero-forcing matrix. The rank of XJ* A must be 2 to recover ua and v A - Then, the 
average throughput of receiver A can be derived as 



Ra{P) 



-E 



log; 



det (I 2 + f H^U^U^H^ + f l B \J A U\l B ) 



(25) 



det (l 2 + f I B U A U A P B ) 

Define the singular value decomposition I B = UEV*, where S = diagjoi, a 2 } and XJ* A is chosen by 
canceling as much interference as possible. Using matrix analysis, we can derive 



det ( 1 2 + -I B V A V* A I B 



(a) 
(*>) 



<\<a ( i 2 + ^v* A r B i B v A 



p 



XI) 



(c) 

> 1 



p 



(A 2 + xl) 



® 1 4- P HT TT II 



> 1 + 



P 



-<7. 



2- 



(26) 



where (a): follows from the fact that det (I + AB) = det (I + BA). 

(b) : follows from the definition that Ai, A2 are the singular values of 1 b XJa- 

(c) : follows from neglecting the remaining nonnegative parts Af A 2 . 

(d) : follows from the definition of Frobenius norm || • \\p. 

(e) : follows from the fact that min UA: u^u A =i ||IbUa||f = of- 



Choose XJa to be the last 2 columns of V, i.e., XJ A 
interference power therefore becomes 

P 
2 

Then, by the definition of multiplexing gain in ©, we have DoF A (2) 



V(2 : 3), the lower bound is achieved. The 



det ( i 2 + -i B v A v* A r B 



1 + -al 

2 2 



(27) 



— lim 

3 P-S>00 



E [log 2 det (I 2 + f H A U A U A H A + f l B \J A V\l* B )] E [log 2 det (l 2 + f I B U A U^)] 



1 



lim 

3 P->oo 



(a) 2 

3 

(j 2 _ 1 
3 3 

= I lim 

3 3 Vp^oo 



log 2 P 

Elog 2 det (I 2 + |I B U A U^) 



log 2 P 



lim 

P->oo 



E[log 2 (Pa 2 



loe 9 P 



log 2 P 

2 )1 

^ vo 



E[log 2 (a 2 2 )] 
log 2 P 



V(-l) 



(28) 
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where (a) follows from the fact that the rank of H^Ua is 2 almost surely and that the signal power 
dominates the interference power when P — > oo; (b) follows from (127b and let P — > oo. Finally, the 
proof is completed by considering receiver B similarly. ■ 

B. Proof of Theorem [2] 

Proof: From Lemma [fl the multiplexing gain can be derived as 



2 2 

DoF(2) = I lim E 

3 3 \ p->oo 



log 2 (<T 



log 2 P 



V 



g 2 _ 2 / jgagjga) v (_!) 
3 3 \p->oc log 2 P 



(J 2 _ 2 / log 2 (E [sin 
3 3 I p^o 



log 2 P 



V 



3 3 V p ^°° log 2 P 



A 1 



2 2 

— I — (a A 1) 

3 3 V ; 



(29) 



where (a) follows from Jensen's inequality and log(x) being a concave function, which gives the 
inequality >. Moreover, since P — > oo and thus B — > oo, a 2 converges to almost surely. Therefore, 
the inequality > becomes an equality. 
(b) follows from the fact that 



(To 



cos6 = 9 (sin 2 fl) 



(30) 



which is proved in Appendix [Gj 

(c) follows from the lower and upper bound of E [sin 2 9] in ©. 



C. Proof of Lemma \3\ 

Proof: Focus on receiver A first. Similar to the two user case, the average throughput of receiver 
A is given by 



Ra{P) 



11 



-E 



log; 



det (I 6 + f H^U^ + f I B>C U A U^ C ) 



(3D 



det (I 12 + f I B , c UAU* A I* Bt c) 
Choose U A as the last 6 columns of V: = V(6 : 11), where Ibo — USV* is the singular value 
decomposition. Then, the interference power can be simplified as 



det (l a + jlBfiVA^Bjo) = IK 1 + f = fl(l + ^i) 

^ ' i=6 i=6 



(32) 
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where the last equality follows from the fact that 010 = <7n = since \b,c contains two zero row 
vectors. Then the lemma naturally follows by considering the definition of the multiplexing gain and 
the fact that the rank of H^U^ is 6. ■ 



D. Proof of Theorem \3\ 

Proof: From Lemma |3j we have 



6 3 f 
DoF(3) = — - — V I lim 

V ' 11 11 ^ \P^a. 

4 — R \ 



6 

IT 



i=6 
9 



E( lim 

4 — fi \ 



i=6 
9 



(a) 6 3 

~~ TT ~ IT ^ 



ll 

6 

IT 

6 



+ 



i=6 

12 / B 



lim 

P->oo 



E[log 2 (a, 2 )] 
log 2 P 

log 2 (E[a 2 ]) 
log 2 P 

log 2 (E [sin 2 #)] 



V 



V 



log 2 P 



11 V21og 2 P 



A 1 



H (a A 1 

11 11 V y 



-1) 

v(-i; 



where (a) follows from the fact that 

E[cx 2 ] = (E[sin 2 9]), for i = 6, . . . , 9, 
which is proved in Appendix UU 



(33) 



(34) 



E. Proof of Lemma \5\ 

Proof: Focus on receiver A first. As can be seen in ([Tot , in order to achieve the maximal 
multiplexing gain, receiver A must try to zero-forcing the interference, i.e., 

U* A y A = V* A H.* A u A + ir A r /A u /A , (35) 
where is a D x T zero-forcing matrix of rank D. Then, the average throughput of user A is 

det (i D + £ n A v A v* A n A + p /A UAU A r 



Ra(P) 



T 



-E 



log 2 



<H [ I (K . l)D + £ I /A U A V* A I* /A 



(36) 



Choose U A as the last D columns of V: = V(T — D + l: T), where I/a = USV* is the singular 
value decomposition. Then, the interference power becomes 



det I 



-(A'-l)D 



P 

— I /j4 U A U^I* A 



T 

n 

i=T-£>+l 



T-D/K 

n 

i=T-D+l 



(37) 
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where the last equality follows from the fact that I/ A contains -S zero row vectors. 
Then, by the definition of multiplexing gain in (O, we have 

DoF A (K) = lim Ra{P) 



P->oo log 2 P 

P 

K 



( = } ^-i limE 
T T p^oo 



log 9 det ( + —l/ A \J A \J* A l* A 



{b) D 1 / E[log 2 (Pa 2 )] 



i=T-D+l 
T-D/K 



7 , lim ozv ^ V 



i=T-D+l \ 62 / 

where (a) follows from the fact that H^U^ is of rank D and that the signal power dominates the 
interference power when P — > 00; (b) follows from (|37l ) and let P — > 00. Finally, the proof can be 
readily completed by considering all the users. ■ 

F. Proof of Theorem 

Proof: From Lemma |5l we have 

T T . jf^ . Vp^oo log 2 P v ; 



i=T-D+l 



(0) P 
T 


K 
T 

i 


T-D/K 

E 


(A) 


(K 


-1)P 


T 




T 


(c)D 
T 


(K 


-1)P 




T 


D 

= T + 


(K 


-1)D 




T 



log 2 (E[a ? 2 ]) v( _ 1} 



P^oo log 2 P 

lim log 2 (E[sin^]) 

P->oo log 2 P 

lim ^ A 1 

p^oo (K- l)log 2 P 

(a A 1) , (39) 

where (a) follows from the Jensen's inequality and the fact that log(x) is a concave function: we have 
the inequality >. Moreover, since P — > 00, B — > 00, <jj converges to zero almost surely. Therefore, 
the inequality > becomes equality. 

(b) follows from the fact that 

E[a 2 ] = (E[sin 2 9]), for i = T - D + 1, . . . , T - ^, (40) 

which is proved in Appendix |Hj 

(c) follows from the lower and upper bound of E [sin 2 9] in ©. ■ 
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G. Proof of $M 

Proof: To find the a 2 , it suffices to consider the last two rows of I* B , denoted by I* B . Let us define 
a new 2-dimensional unit norm vector a such that its angles to the two rows of 1^ are the same. 
Also, define a 1 - as the 2-dimensional unit norm vector which is orthogonal to a. Then we have the 
following singular value decomposition of 1^: 



1 1 

1 1_ 

%/2 V2 



V2 



cos 







V2 si 



sm 



a* 



(41) 



Therefore, a\ = 2 sin 2 f = 6 (sin 2 0) . 



H. Proof of $M and (g0 

Proof: It suffices to consider the nonzero rows of interference matrix I* A . Thus, let us remove 
the zero rows and still denote it as I* A for ease of notation. First prove the upper bound of singular 
values of I* A . Denote I* A with no quantization error as 1* A and define E = I* A — 1* A . Let Ej denote 
rows of E for % — 1, . . . , T — j^. 

From the perturbation bounds for the singular values of a matrix due to Weyl [25 J, we have 

D 
K 



aA < \\E\\ F , for i = T - D + 1, . . . ,T 



(42) 



where Oi is the singular value of 1* /A . Since I* /A is of rank T — ^, we have that a, = for i 



/A 



T-D + \,...,T-%. 



K " 



Moreover, E [||Ei|| 2 ] < 4E [sin 2 §] , for i = 1, . . . , T - § . Therefore, E\\E\\ 2 F < 4(T- § )E [sin 2 f]. 
It follows then 



E[a 2 ] < 4(T-^)E 



9 



sm 



0(E [sin 2 ^]) , for 



l,...,T-£. 

' ' AT 



(43) 



Next prove the lower bound, i.e., E 



o% D 



Q (E [sin 2 9] ) . Let Ij denote rows of l*, A , for 



T — ^ and Sj denote the space spanned by all the rows Ii, 



dist(L,Si) := min ||Ii-Xj|| 2 



, It_o other than L. Define 



(44) 



Due to the quantization error, dist(L, Sj) > 2 sin | almost surely when 9 is sufficiently small. Further- 
more, from the negative second moment identity in [26| we have 



J2^ 2 = E dist ft' s * 



-2 



(45) 



8=1 
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Then it follows that 

K 1=1 1=1 

K i T — 2- 

< Y — V" = — -£ L - (46) 
~ ^ 4sin 2 # 4sin 2 # V 

i=i 

Therefore, we have 

a 2 R >^- = n( S in*9). (47) 

K 1 - K 

We conclude that K[af] = (E[sin 2 9}) , for i = T - D + 1, . . . , T - § . ■ 
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Table I 

Approx. range of optimality of the MAT scheme with LTE-like parameters, see Sec. |IV-E1 



Number of antennas K 


Coherence time N 


Coherence time T c (msec) 


Velocity v (km/hr) 


2 


2 < N < 5000 


T c < 30 


v > 17 


4 


7 < N < 3200 


0.04 < T c < 20 


27 < v < 12,000 


16 


46 < N < 2400 


0.3 < T c < 14 


36 < v < 1900 



t=2 t=3 



LJu„v,)l2] |j.;[3](t,(» i ,»,)[l]ti,(.„r,)[i| 




Figure 1. Block diagram of the MAT scheme with quantization error for K — 2. 
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Figure 2. The net DoF for MAT, SISO and multiuser ZF precoding versus the coherence block length N with large K. Note the is 
figure is not to scale (the range of optimality for MAT would be much wider). 
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Figure 3. The net DoF of MAT, SISO and ZF for K — 2 with varying coherence block length TV. 
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Figure 4. The net DoF of the MAT, SISO and ZF for K — 3 with varying coherence block length N. 



